# Are Voters' Preferences Being Ignored in Governor's Agendas?
## Benjamin S. Noble and Daniel M. Butler

## Table of Contents

### Analysis
- `20260130-analysis-code-final.R` creates all tables and figures.

### Data
- `sots-df.csv` speech-level data for ~2,400 State of the State addresses (1962-2023), including partisan slant measures, governor covariates, and classification results.
- `sots_sents_df_small.csv` sentence-level data with predicted partisan probabilities from fine-tuned BERT models.
- `cumulative_2006-2024.dta` Cooperative Election Study (CES) cumulative data file, used for governor approval analysis. Available from https://cces.gov.harvard.edu.
- `state_policy_liberalism.dta` state policy liberalism estimates from Caughey and Warshaw.
- `gov-congress-nominate.csv` governor NOMINATE scores matched from prior congressional service.
- `gov-nominate-agg.csv` aggregated governor-level NOMINATE scores for validation.
- `gov_elex_results.csv` governor election results, 1960-2023.
- `missing-gov-party.csv` governor party information for state-years missing from the speech corpus.
- `census-regions.csv` U.S. Census region classifications by state.
- `perm-test.df` permutation test results for classifier validation.
- `sots_eras.csv` speech-level data from era-specific BERT models (appendix).
- `ra-codes/` directory containing research assistant hand-coding data for pairwise validation of the partisan slant measure:
    - `pairwise_ra_codes_ans_20251216.csv` true pairwise comparison answers.
    - `pairwise_ra_codes_20251216-s.csv` RA 1 pairwise codes.
    - `n_pairwise_Ra_codes.csv` RA 2 pairwise codes.
- `cumulative_2006-2024.dta` the CES cumulative file (https://cces.gov.harvard.edu/explore)

### Misc
- `ggtheme_baselike.R` custom ggplot theme file used for plot formatting.
- `readme.txt` (this file)

## Computing Environment
R Version: 4.5.2 (2025-10-31)
Machine: Bens-MacBook-Pro.local arm64
Operating System: Darwin 25.2.0 Darwin Kernel Version 25.2.0
Base Packages: stats graphics grDevices utils datasets methods base
Other Packages: ngram digest marginaleffects fixest haven ggridges quanteda stm modelsummary kableExtra tidyverse (lubridate forcats stringr dplyr purrr readr tidyr tibble ggplot2)
